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GENE CLUSTER 

BACKGROUND OF THE INVENTION 

5 Polyketides constitute a large and highly diverse group of secondary 

metabolites synthesized by bacteria, fungi and plants, with a broad range of biological 
activities and medical applications. They include anti-cancer agents (Daunorubicin), 
antibiotics (tetracyclines, erythromycin etc.), immunosuppressants (macrolide FK506) 
■,f^ and compounds with mycotoxic activity (aflatoxins, ochratoxins, ergochromes, 

Tl 

I-^ 10 patuiin etc.). Polyketides are synthesized by repetitive condensations of acetate or 

n 

propionate monomers in a similar way to that of fatty acid biosynthesis. Structural 

IP 

diversity of polyketides is achieved through different thioester primers, varying chain 
extension units used by the polyketide synthases (PKSs), and variations in the 

13 stereochemistry and the degree of reduction of intermediates. Diversity is also 

9 

Q 15 achieved by subsequent processing, such as alkylations, oxidations, O-methylations, 
glycosylations and cyclizations. Genetic studies indicated that gene organization of 
functional units and motif pattems of various PKSs are similar. This similarity was 
used to identify and obtain new PKS systems in both gram negative and gram positive 
bacteria. 
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PKS systems are classified into two types: type I PKSs are large, 
multifunctional enzymes, containing a separate site for each condensation or 
modification step. These represent "modular PKSs" in which the functional domains 
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encoded by the DNA sequence are usually ordered parallel to the sequence of 
reactions carried out on the growing polyketide chain. Type II PKSs are systems 
made up of individual, enzymes, in which each catalytic site is used repeatedly during, 
the biosynthetic process. 

Genetic studies on prokaryotic PKSs have focused on gram positive 
microorganisms, particularly on actinomycetes. My:xobacteria are gram negative 
bacteria that produce a large number of secondary metabolites, including polyketides. 
Myxococcus xanthus produces TA (Rosenberg, at al., 1973; Rosenberg, et al., 1984), 
which is an antibacterial antibiotic. 

The polyketide antibiotic Tel- Aviv (hereinafter TA) (Rosenberg, et al., 1973) 
is synthesized by the gram negative bacterium Myxococcus xanthus in a unique multi- 
step process incorporating a glycine molecule into the polyketide carbon chain, which 
is elongated through the condensation of 1 1 acetate molecules by a type I polyketide 
synthase (PKSs). 

The antibiotic TA was crystallized and its chemical properties were 
determined. It is a macrocyclic polyketide sjmthesized through the incorporation of 
acetate, methionine, and glycine. It inhibits cell wall synthesis by interfering with the 
polymerization of the lipid-disaccharide-pentapeptide and its ability to adhere avidly 
to tissues and inorganic surfaces makes it potentially useful in a wide range of clinical 
applications, such as treating gingivitis. 
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A growing interest in the study of PKS systems and peptide synthetase 
systems stems from the need to develop new potent biologically active compounds. 
The use of combinatorial genetics in both systems (PKS and peptide synthetase) 
separately has led to the production of new polyketides and new peptides. 

It would therefore by useful to be able to generate new biological agents from 
secondary metabolites of the antibiotic TA. 

SUMMARY OF THE INVENTION 

According to the present invention, there is provided a purified, isolated and 
cloned DNA sequence partially encoding a functional portion of a polypeptide 
component required for the synthesis of antibiotic TA. Also provided are purified, 
isolated and cloned DNA sequences encoding a polypeptide component reqtiired for 
postmodification of antibiotic TA and encoding a gene product involved in the 
regulation of the biosynthesis of antibiotic TA. A purified, isolated and cloned DNA 
sequence having a DNA sequence (Seq. ID No j4 - aaa d 2) encoding a polypeptide 
component required for encoding the TA gene cluster and any mutations thereof is 
provided. Also provided are methods of using the TA genes for combinatorial 
genetics and of using the TA genes encoding for synthesis and modification or 
regulation of antibiotic TA. 
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DESCRIPTION OF THE DRAWING 



Other advantages of the present invention will be readily appreciated as the 
same becomes better understood by reference to the following detailed description 
when considered in connection with the accompanying drawing wherein: 

Figure 1 shows the physical maps of the DNA regions involved in TA 
synthesis. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention consists of a DNA^sequence of at least 42 kb encoding 
genes involved in TA production and Myxococcus xanthus as best shown in Seq. ID 
No:l through 17 and cosmid clones containing the entire TA gene DNA sequences. 
The TA gene cluster has been purified, isolated, and cloned. The purification, 
isolation and cloning was done according to the methods described in Marshak et al, 
"Strategies for Protein Purification and Characterization. A laboratory course 
manual." CSHL Press, 1996. 

A DNA firagment of at least 42 kb (Figure 1), encoding genes involved in TA 
production in Myxococcus xanthus has been identified, cloned and analyzed. These 
steps were done in accordance with Marshak et al, "Strategies for Protein Purification 
and Characterization. A laboratory course manual.'* CSHL Press, 1996. This 
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fragment contains a large region of about 20 kb, encoding the genes responsible for 
the regulation and the post-modification of TA. An additional fragment of 
approximately 8-10 kb located 10-20 kb downstream of the post-modification region, 
encodes the enzyme responsible for the incorporation of the glycine into the 
polyketide chain. This novel polypeptide is made up of a peptide synthetase unit 
lying between two PKS modules. 

The potential of this unique polypeptide in combining the two systems can 
lead to a new family of compounds, emerging from various combinations which can 
be utilized for combinatorial genetics. Such utilization can produce, for example, new 
bioactive agents, new polyketides and new peptides. Additionally, the TA gene 
cluster can be utilized in a method for the synthesis, modification or regulation of the 
TA antibiotic. 

Mutations imparting defects into the TA gene cluster can be point mutations, 
deletions or insertions. The mutations can occur within the nucleotide sequence of the 
allele of the TA gene cluster such that the resulting amino acid sequence of the TA 
gene cluster product is altered. 

In one embodiment of the present invention, the TA gene cluster can be 
included in a vector or recombinant expression vector. This vector containing the TA 
gene cluster is able to transform a suitable eucaryotic or procaryotic host cell. A 
suitable host cell can be determined by one skilled in the art. An example of a 
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suitable cell which can be transformed by the TA gene cluster is an E. coli cell. 

In another embodiment of the present invention, the a DNA fragment encoding 
the TA gene cluster can be cloned into a cosmid, as shown in Figure 1 . This DNA 
fragment contains a large region of about 20kb, encoding the genes responsible for the 
regulation and die post-modification of TA. An additional fragment of approximately 
eight to ten kb is located 10-20 kb downstream of the post-modification region and 
encodes the enzyme responsible for the incorporation of the glycine into the 
polyketide chain. The novel polyketide chain is made up of a peptide synthetase imit 
lying between two PKS modides (See Figure 1). 

The above discussion provides a factual basis for the use of the TA gene 
cluster. The methods used with and the utility of the present invention can be shown 
by the following non-limiting examples and accompanying figure. 

EXAMPLES 

GENERAL METHODS : 
METHODS : 

General methods in molecular biology: Standard molecular biology 
techniques known ia the art and not specifically described are generally followed as in 
Sambrook et al.. Molecular Cloning: A Laboratory Manual, Cold Springs Harbor 
Laboratory, New York (1989, 1992), and in Ausubel et aL, Current Protocols in 
Molecular Biology, John Wiley and Sons, Baltimore, Maryland (1989). Polymerase 
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chain reaction (PGR) is carried out generally as in PCR Protocols: A Guide To 
Methods And Applications, Academic Press, San- Diego, CA (1990). Reactions and 
manipulations involving other nucleic acid techniques, unless stated otherwise, are 
performed as generally described in Sambrook et al., 1989, Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, and methodology as set 
forth in United States patents 4,666,828; 4,683,202; 4,801,531; 5,192,659 and 
5,272,057 and incorporated herein by reference. In-situ (In-cell) PCR in combination 
Mdth Flow Cytometry can be used for detection of cells containing specific DNA and 
mRNA sequences (Testoni et al, 1996, Blood 87:3822.) 

Recombinant Protein Purification 

Marshak et al, "Strategies for Protein Purification and Characterization. A 
laboratory course manual." CSHL Press, 1996. 

Example 1: 

Analysis of the TA gene cluster bv chromosomal restriction map. 

Chromosomal DNA of several transposition mutants (ER-2514, ER-1037, ER- 
1030, ER-1311, ER-7513, ER-3708, ER-4639 and ER-6199; Varon et al, 1992) was 
extracted, digested v^th restriction enzymes that cut within the transposon, and 
analyzed by Southern hybridization with six different probes (originating fiom TnK 
and TnSlac). We used probes designed to hybridize either to the entire transposon, or 
to its 5' or 3' ends. A chromosomal restriction map of the whole gene cluster was 
constructed on the basis of these results (Figure 1). The data refined the transduction 
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map (}/axon et aL, 1992) and flirther indicated that all the genes in the cluster are 
transcribed in the same direction (see Figure 1). 

Preparation of TA-specific probes 

DNA from the TnFmutant ER-4639, ER13 1 1 and ER-6199 was digested with 
Kpnl (does not restrict TnF), self-ligated and transformed into E. coli XL 1 -Blue MR 
using the transposon-derived kanamycin resistance for selection. Tranformant clones 
pPYT4639, pPYT1311/p5 and pPYT6199 carried a 1.5 kb, 2.3 kb and a 11,2 kb 
fragment, respectively (see Figure 1). 

Cloning of a M xanthus DNA region encoding genes involved in TA 
biosynthesis. 

A library of M xanthus ER-15 was constructed in the cosmid vector 
SUPERCOS-1 and screened using specific TA probes obtained from transposition 
mutants (ER-4639, ER-1311 and ER-6199, see map) that contain a TnF transposon. 
Seventy four recombinant cosmids that carried genes required for TA production were 
identified through colony hybridization. The cosmids, pPYCC64 and pPYCC44, 
which hybridized to these probes were further characterized through restriction 
analysis (see Figure 1) and sub cloned for sequencing. 

Throughout this application, various publications, including United States 
patents, are referenced by author and year and patents by number. Full citations for 
the publications are listed below. The disclosures of these publications and patents in 
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their entireties are hereby incorporated by reference into this application in order to 
more fully describe the state of the art to which this invention pertains. 

The invention has been described in an illustrative manner, and it is to be 
understood that the terminology which has been used is intended to be in the nature of 
words of description rather than of limitation. 

Obviously, many modifications and variations of the present invention are 
possible in light of the above teachings. It is, therefore, to be understood that within 
the scope of the appended claims, the invention may be practiced otherwise than as 
specifically described. 
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REGION 1: \ 

Tal * Peptidesvumetase uait-FKg aaodnie, 

FRAGMENT sizef^1:2392 

VDPARLTBAWEGLLERYPIXaGAjCR^^ 
GPLARLHLYSRSEHEHVIiLCFHHL 
LAlGGDEGRRHLI3YWRIWLATPVPPJ?t^«J^ 
LPTVIXGLYYAIXHIUrrEtoDDWVGIPTMG^ 
VIDGLEHAHYPFPRVVKDI^SNGPEEAPGFQT&^ 
PLELEWEGAKGLITJEiFK^ARLYBADTVEIUy^^ 
WNATATPFI^DLGVHELFQRQAEIETPDAMAVSYEGHSLSYQALJDTR^ 
DRSAELVAAMLGVLSAGAAYVPIX)PVHPEDRlJtYMLEDSG^^^ 
DVKAGATSAPAGTSPNGLAYVlYTSGSTGia>KGVMI^ 
lAALELLLPLCAGAQNOIASAEiyRlJAQALKR^ 
AL.PESLKAHF^^TASDVW^fMFbpTE^ 

LWIAGAGVACGYl-^^RPALTAERp^SNPFTPGT^^.YRTGDI^ 

GEIEAQLAGHPSVKNCAVVAKEMsTGTSQLVAYCQPAGTSFPEEAIILA^^ 

LSGNGKVDRGQLMAKPVVTRKKTSAVI^^ 

LVEEMNWO^DTRLAVTDLFKYVNfe^ 

PGAADPWRFWKm-REGRDSWAVTpiEEIJUi^ 

FMDPQFRI.l.LMHAWKAVEDAATTreRLGPCGVFMTASNSFYHQGSPQFPAI^ 

QAGSIPTMVSYKLGLKGPSLFraTNqSSSLSALYVAQQAlAAGDCQT^ 

NFSSAGRVKAFDAAADGMIAGEGVAVLVVKbAAAAVRDGDPIYCLV^ 

GQAEVIRRLFDRTGroPASIGYVEAHGTGTLLGDPVEVSAI.SEAFRTFTO 

AGLAGLIKTALSLRQQEN^PTLJrVTQvWKLELTO 

LEHYPRDSRPRERSQRSNAVRAVAPFS>kTLEALKDNLRAIXDF^ 
ERMVVTASTRDELVEGLRRGIAWGG^GTVVDTSPSVDADA^ 

GAEGASAJLASTX^DRRIEVVRTSSPSARUiA^^ 

LSGLGSLLRSVSQE>IPLVRGQLmVQGSVS^^ 

GDASRFWREDGVYVISGGTGAl^FYAH^ 

SSLAAFGNAGQSDYAAANGFMDGFAESRAAiVNAGQRQGRT^ 
r^n^P^'V^^^^^^^^^^^^^YGAELEfe 

SASLTAVHMACE^QRGACVMAJAGGVNLYvSteSYVSLS ^^i^J^^t^NU^wmmc 
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DNA segMenceNmucieotides 1-7178. 



GTCGACCCGOCGAGfeCTGACCCGGGCCTGGGAAGGACTGCTCGAACGGTATCCGCTGCTCGCTGGC 

GCGATTCGCGTCGAAGGCACGGAGCCGGTCATCGTCCCCAGTGGQCAGGTCTCCGCCGAGGTCCAC 

GAGGTTCCATCGGTcfccGATTCAGCACTGGTGGCGACCCTGCGCGCCTCCGCGAAGGTGCCArr^ 

ATCTCGCCTGTGGACCGCTCGCTCGGCTGCACCTGTACTCGCGGTCGGAGCACGAGCATGTCCTGCT 

GCTGTGCTTCCACCACCTGGTGCTCGATGGGGCATCCGTGGCGCCCTTGCTCGACGCCCTCCGGGAG 

CGTTACGCCGGGACCGAqGCGAAGGCGGGGCTGCTCGAGGTTCCGATCGTCGCTCCTTACCGCGCC 

gccgtggagtgggagcagctcgccattqgaogcgatgagggacggcgccacctcgactactggcgg 

CACGTGTTGGCCACGCCCGTTCCTCCQCCGTTGAATCTTCC^ 

ggctggactcggagggagcXacgcactcgcagagggtgcccaccgagcaagcattgcgactgcgcg 

AGTTCGCTCGGGCACAGCAAeTGAGCCTGCCGACCCTCCTGCTCGGGCTCTACTACGCCTTGCTTCA 

tcggcacacgcgccaggacgAcgtggtggtcggcatccccaccatggggcggcgccgggcggaact 

GGCGACGGCGATTGGGTACTTOpTCAACGTGATGGCCGTCCGCGCGCGGGC^ 

GTTCGGCTCGCTGCTGCGCCACOrrcCACGACTCGGTCATCGATGGCCTGGAGCATGCCCACTATCCC 

ttcccgcgagtggtgaaggacctccggctgtcgaatgggcccgaggaggcgcctggcttccagacg 

atgttcaccttccagagcctgcaactgacgagcgctgcgccaaggccggagcccaqgtcgggcggg 

ttgccggagcttgagccgctcgac\gcgtccatcaggaaggcgcctacccgctggagcttgaagtgg 

tggagggcgccaagggcctcacgctgcatttcaagtacgacgcgcggctgtacgaggcggacacgg 

tcgaacggatgccgcgtcagttgttqcgcgccgcggaccaggtcgcggatggggtggagtctccgc 

tgagcgcactgtcgtggctcgacgac^Vlagagcgccgcacgcttctccgcgactggaatgccacgg 

ccacgccgttcctcgaggacctgggcgttcacgagctcttccagcggcaggcccgggagaccccag 

acgccatggctgtgagctacgaggggcActcgctcagctatcaggcgctggatacgcggaqccgcg 

agattgcqgcgcacctgaagagcttcggAgtcaagcctggggcgctcgtgqgcatctacctggacc 

ggtccgcggagctggtggcggcgatgctgggtgtgctgtccgctggcgcggcctacgtacccctgg 

acgcggtgcaccccgaggaccggctgcggtWcatgctggaggacagtggcgtggtggtcgtgctgg 

CCCGTCAGGCCTCGCGGGACAAGGTCGCCGdp^TTGCCGGAGCCTCCTGCAAGGTGTGCGTG 

AGGACGTCAAGGCTGGAGCCACGTCCGCGCCGGCGGGAACCTCACCGAACGGACTTGCCTACGTCA 

TCTACACGTCCGGGAGCACGGGCCGGCCCAACkKiCGTGATGATTCCCCATCGCGGGGTGGTCAACT 

TCCTCCTGTGC::ATGCGCAGGACGCTGGGCCTGAiA,GCGCACGGATTCGCTGTT^ 

CTGCTTCGACATCGCGGCGCTCGAGCTCCTGCTTCCGCTGTGTGGGGGGGCGCAGGTCATCATC 

tcggcggagacggttcgggatgcgcaggcgttgAagcgggcgctgcgcacccatcggcccacgttg 

ATGCAGGCGACGCCCGCGACCTGGACACTGTTGTTCCAGTCTGGCTGGGAGAACGCCGAGCGGGTT 

cgaatcctctgcggtggagaagcgctgccggagtcJgctcaaggcccacttcgitcgcaccgcgagc 
gacgtgtggaacatgttcgggcccaccgagacgac&atctggtcgacgatggcgaaggtctcggcc 
tcgcgtccggtcaccattggaaagccgatcgacaacacgcaggtctacgtgctggacgaccggatg 

CAGCCGQTGCCCATCGGTGTGCCGGGCGAGCTGTGGi^^GCGGGCGCGGGCGTGGCCTGCGGTTAC 
CTCAACCGGCCGGCGCTGACCGCCGAGCGCTTCGTTTdpLATCCGTTCACGCCGGGCACGACGCTCT 

accggacgggggacctggcgcgctggcgcgctgacggts:gaggttgagtacctgggck:ggctcgacc 

ACCAGGTGAAGGTGCGCGGCTTCCGCATCGAGATGGGGfeAGATTGAAGCGCAGTTGGCCGGGCATC 
CCAGCGTGAAGAACTGTGCCGTGGTGGCCAAGGAGCTGi^CGGCACCTCGCAGCTCGTCGCCTACT 
GTCAGCCCGCGGGAACGAGCTTCGATGAGGAAGCCATCCGTGCACACCTGCGGAAGrrCGTCCCCG 
ACTACATGGTCCCCGCGCACGTCTTCGCGGTGGATGCGATTCCGGTGTCGGGCAATGGCAAGGTGGA 

ccggggccagctgatggccaggccggtggtcacccggcggAagacatccgcggtccatgcccgttc 
gcctqttgaggccaccctcgtcgagctgtgga.^gaacgtgctc:caggtcaacgaggtgggtgtcga 
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GGATCGCTTC1 

ACCCCTGGCGCTTC3TCGAAGAACCTGCGAGAGGGCAGGGACAGCGTGGTGGCGTACCGCCATGA«--rn 

aactgcgcgagctgWgtgcccgaggaggtcctccgcqattcccgttaStcSg^^ 

CATCGAAGACAAGG%GCTTCGACCCGCATTIX:TrCGGTCTGACGGCQbGGGAcS^^ 

gacccccagttccga2tgttgctgatgcacgcctggaaqgcagtggASc£g^a^^^ 

AATTTCCTGCGGACGGG&AGCCGGTCCTCCGCACCGCCQAAGAATACGTGCTGTGGGTGCTGGCrrA 

GGCAGGCTCCATCCCGAdkTGGTTTCSTACAAGCTCGGCTTGAAGGGGoS 

ACCAACTGCTCGTCATCCcVGTCCGCGCTGTACGTGGCTCAGCAGGCcJrS^ 

™T'^'^^^^'^'^'^*^°*^°T^°'='2CGTCAAGGCCTTCGACGCCGCGGCGGACaGCATGATTGCCG 

GTGAAGGTGTCGCCGTGCTGd^GGTGAAGGACGCCGCAGCGGCGGTGCGCGATScSL^^^ 

ACTGCCTCQTGCGGAAGGTGG%5ATGAACAACGACGGCCAGGACAAGGTGGGTTTATAC^ 

gcgccaccgggcaggcggagg\:atccggcgtctgttcgaccgg^«^^c™c^^^^^^ 

TTGGCTACGTCQAGGCCCATGGdACCGGAACCrrTGCTGGQTGACCCTGTCQAGG^^ 
CGAAGCCTTCCGGACCTTCACCGTcCGGCGCGGGTACTGCCGGCTGGGCTra 

gggccatctggacacagtggctgSlCtggctgggctcatcaagacggcgctgacS^^ 

^°^£I3^^'^^'=°^C°CTCCATGT%CCAGGTGAATCCGAAGCTCGAGCT^^^^^ 

gtcatcgccgaccgtttggcgccgt^bgccgtccctgccgggaccqaggcgggcggccgtS^^ 

TTCGGCCTTGGCGGGACGAATACCclcGCCATTCTCGAACACTACCCGCGCGAC^^^ 

agaggagccagcggtcgaacgcagtAcgtgcggtgqctccaitctcggcgcc^ac^^^^ 
tgaaggacaacctccgcgcgctgctcX.cttcctggaggacccggcqtSS^^^ 

CGGACATCACCTACACGTTGCAGGTCGi&:CGGGTCQCGATGCCTGAGCQGATGSTGGT^^ 

^?^S2S°^°^^°^ttggtggagggacVcggcgaggcatcgccacggtgggcg^^ 

GAACGGTGGTCGATACGTCACCCAGCGTVATGCCGATGCrrCGGGCAGTTGC^SG^GT^ 
CQGGCGACTCGATTGACTGGQATTCGCTQtACGGTGACGTGAAGCCCGCCCGTGT^^ 

gtatcagttcgcgaaggagcgctacgggAgtcgcccgcgcactccgtggSaaSot 

CCTCGTTGGCGCTCCGGCACCTQGTGGTGTTeSTGCGAGCCTCTTGATGCGCTGGGCGCTGAAGGTGC 

^3^^^^^^?"°^^°a°*^c°ctcgcggacagg%catcgaagtggtcaggac^ 

GCGGCTGGACGCGCGGTTCATGGCGCATGCCT^GGCGGTCTTCGAACGCGTCAAGGTOCTC^ 
GAGCGTCTGACCGCTCCTGTGACATTGCAGGTGicTGGTGCCAGACGAG^GS^SGSTCSS^^ 

^^I^^S^^J^^'^^^^cctgctgcgttcggtgtcgSagga 

cgcgtccagggaagcgtctccgcatcggcgctgg^ggacgttctggtgaagtccgcgcgS^ 

GACGTCACCGATTCGCGGTACCACGCGGGCCAGcfecTCGCTGTQAGTG^GcSAG^ 

ccaagggggacgcatcccgcttctggcgcgaagacWgtctatgtgattiSggaS^^ 

CCGCGCATCCTCGGCGGAQGCGGTGGACGGTGGGAACGGGCTGCGCGTGCGGCACCTTCCCC3TGQA 

tgtcacccaaccgaacgacgtgaacgcctttgtcgctXcqgtgctgcgc^ScggSSS?^^^ 

ggtgtcatccatgcggcgggcatccgccgtgacaactAcctgctcaISIa^cSS?^c^ 

^^9??ggtgctcgcgcccaaggtggtqggoctcgtca^ctcS^^^ 

CTGGATTTCTTCGTCACGTTCTCGTCCCTGGCCGCGTTTcfcAAACGCCGSc^^ 

?5^^^^^^°cttcatggacqgattcgcgcagtcccga^ggcgctcqtSacgc^^ 
g^^g^SJ^SS^Ttg"^^^^^ 

GTGAACTCTCCGTGAGTGTTTCGCCCGCACCGCCTCCGCATcVGGTGGCQTTGaACGCCrx-rc-wTxTrTTr 

CCGCATCGATGCCCGCCAGCCGATGGAGCGCTATGGCATCG/^TCCATCATCATC^CGC-ArvATrAAr^ 
CAAGCCCTCGAAGGGCCGTACAACGCCCTCTCGAAGACGC?G5fcT?C^^ 

AAGTCAGCGGGTATCTGGCCGAGCACCGCGCGGAAGAGAGCGdGAAGTGQGTScG^S 

^^^f^^ir^^T'=^CCGTCATCCAGGAGGCCAGGCCGCCACGTGcS^^ 

^^^2S^°;^5=QAGCCCATCGCCGTCATTGGCATGAGCGGCCGTTA?tCCG^ 

CATGTGCTCGGCCTCGCTGACAGCCGTCCACATGGCTTGCGAGGCGCTGtAACGAGGCGCCTGCGTC 
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ATGGCCA^tGCGGGTGGAGTGAATCTCTACC-TCCACCCGTCGAGCTACGTCAGCCT^ 
AGATGCTG*^CGAC 



REGION 2 

TaRl - Sarface laVer protein 

^lKVVNKLLEKLI>DVVAWvI»l^V^a.QDQDIKV^ 
LGDVDENGnGE^^SPESA&GPIO'QIXLGSDIGWMRYQVSAIlVKj^ 

hplgqkmbeavrsdlshSrlmqato 

BI.LALKTSAI<AGLSARVstoDYQLSFSRPRAGRlQVA 
QLI^X.LGPVLIU:>LVKKGTtrAV^ 

ADFKARVAESLENAVTlTOvVEGFEYEYI.RLSETSTLIJ3VVVmDV^ 
AQOSEFELRNYLHATTLmoW 

lgtrgqwvvdlkadmtWs]^^ 

ATVISTMQEDMGKHPIETJRl^IiJ^^ 
APIWEAYIJiEVQEQGSIJva.^^DlsPSRAAQIA^ 

KKPNTLMRCRHFVSGMVRL^^ 

VERXLTVRVADSEEQLVFSTAR5TGAA j 



TaM2 - two componcMt svstp r^ , WToomse regulator 
From Micleotide ? ^ i ^ ii-^k 47^-7 ^ siAfaa^: S29, 
MPSGCYGAASAPVtPPXJPAMPQAPSD^^Q^^-^^'I'^^LV^ 

dparrelfsbaahlpevsqirvklgqgvagtvakaghai^ 

3UDGDGALYGVLQVI.NKRGEDRFroEi:)TQRLTAI^^ 
ESPQLQAIY1G.V^IKAAPTDATVL1^GESGS^^ 
ERGAFTGADffilVPGKFEAASGGTWlDKGELJPI-PVQGKLLR^ 
DLARMVaEGRFREDLYYRIKWEWI^PL%RGAEDIER^ 
yRWPGNVIU£I^CffiSAVVl.CEeElLE3EHLPLPDV^ 
iam.KVI.DAVKG>rRTAAARVLAJGF>nT^^ 

TaiR3 ^ two compoiaent syatet^i Bdmase semtkor. 
From BiTutcleotide 5595 to 47 ? ft «=^'™(aa^: 292^ 

mrasqaeaphsrrltmevrphgvrgsiavsgsrig^^ 
apqeatxffshlhwdrvqoepfftpawlptsex-tly^ 
^<mfrsalharpvevgpfrvttidvphpqgcl.ayrl^^ 
cldaqytpdeyegrkgvakkgwghstm^^daagva^ 

lppvceparecqrlvlgraa 



TaA ■ NUS-G liks tran vg^^' pt^QflB a Tititermiaatior- 
grom nucleotide 6290 to 6793. , ^ige^aaV' 16§ 

MPQPRCAENDWVALLVRV>rEIEKV.A^'^AQX-Gm 
YRIVRAPGVIRLLGGDaGPEAVPAQELE-4IRRVADSGVSS^" 

QLRFIVSVGLLQRSVSVEVSAEQLEPrrD 
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DYLRVGQRVRHEGPLTGLEGSLVTSKS 
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TaB - acvl Varrier protein f ACP\ 

From nucleotide 6870 to 7106, sizfi(aal: 79 

^IDKIUIFDIVTSsWEVVPELESHPFEPEDDLVGLGANSIJDRAEr^^ 
DVLHARL 



Q 
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TaC - beta-ketnact^ TACP] avmthase m (KAS IH. FabEO 
From mmcieotide 7119 to 8378, sizefaa'^; 420 

MGPVGIEAMNAYCGL^iaLDVLQI^THRGLDTSiarANIXMEEKTVPLPYm 
ERDSIELLVACTESSFDFGkAMSTYUaQHLGLSIU^CIU-IEl^ACYSGVAGLQMAV^ 
WASDLSRFSIAEGGDAST^DWSFAEPSSGAGAVAMLVSDTTRVFRVIJVGANGYYGYEVMDTCRPVAD 
SEAGDADLSlXSVlX)CCEN^FREYTRRVPAA>ryAESFGYLAFHTPFGGMVKGAH3lT!V^^ 

lEADFQRRVAPQLTYCQRV^^GATMALSLLQTIDHGDFATASIUGCFSYGSGCSSEFFSGVVTEEGOO 
RQR^^LGEALGRE.QQLS>4P^YDAIXKGNGLVI«fGTllNAEIJDFGWGSrR^ 

TaD - memljraae associated pgntigm 
From MMcleotide 8404 to 9378. skeefaak 325 

MSSVATAWLTARDSAVSRROtriPSI^GQTSlJ^AGQIGDWAWDTVSRLCGTDVLTATO 

YYFIURGTPAI^GALRFGDTIJDVTSKAYKFGSESVLTVHMCKTAEGGAPEADAF 

YAETF>IRWm^SDGKSNESLIKSSPVGF{5YAHIJ»IXFDEYSPRRAYGDARARGT^^ 

plryavbvirdvngvgliyfasyfsmvdVaiwqlarhqgrseqaflsrvvij^ 

DVQHWERVGGGEELF^m<MREGAQGIUa^VATVKV^ 



TaJE ■ acvl carrier protcSa f ACP'^n 
From saucleotide 9336 to 9364> sige(aa^: 




MTDEQJRGVVHQSIWVLPRVRSNEIAGHL^n:.! 
VaFLAGEVAGG 



.GADSVpRVEILTSILJiDSLRX-QKIPI^KFADIRNm^ 



, FabE) 



TaF - beta-ketoacvl TACPT aviatfaase IH (KAs) 
From arocleotide 9757 to 1087S. sizefaa^: 374 

mmqergvalpfedpvtnavnaarpildamspeam:3ubllvtssesgvdfsk^ 

EVKQACYAATGALQLALCYIASGVSPGAKALVIATDV^VDESGLYSEPAMGTGGVAV^ 

MDLGAFGNYSYDVFDTARPSPEmiGDVDRSLFTYl-DCLmSFAAYGi^ 

GLVTCAGHRKM3VK£L1PCDVDEIEaDFGRRVKPSLQYPS£^^ 

sygsgcsseffsgvtgpesvsal^gldigghlrqrrqi-tfdqyvellkenlrclvpt:^^ 

VTKTASRPKMLAIJLRVVDYHRQYEWV 



TaG ■ signal ueotidase H QLSPA^ 

From nucleotide 10909 to 11421. sizefaa^: 171 
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MNTPSLTNWARLGYLLAVGGAWF^ 

FGEEWllMKTYVVGAICIVLLIGYYFYTPF^^ 

DRPV^SFOTADTAVWGAALMILZ3?SS?R.QQL3?G 

TaH - cvtocWom e ^450 livdro^wlage (cF45(^^ 

From nucleotide 11473 to 12897. sizeraa^: 475 

MGTSEPVEPDHALSV^P'VAPVGAQALPR 
HQTSDPAUEVIFKG^DI^GGKANNGLKPWGEN^ 

rdivnaaldrwpvgkWvheetqqimlevilrvifo]^^^ 

A£GFAKAVGKAFPSLD%SLKAIDD^^ 
MTLI^MAGHETSAIIAA^^^ 

CIGTSFAYYEM<IFVSETVRk^^DXRPGYHAKVV^ ^ 



m 



T^I - m aloavl CoA IFACPT trAnsacvlase (TVtCT. FabB) 
From nucleotide 12938 to 1389V sizefaa^: 318 

fFMAKEL 



MVDSVSKQARRKVFLFSGQGTQSYI 



PRGGMLAVLHELEACRGFTSVAIUDCE\Wi;nrP 

DVGPAASFATHKBILARDSTSRLPPLLSPSP'iJsTGSSMG VJi- liJ i vi,QL,iCQGGCDFl 



TaJ - maionvl CoA TACPT traaasacylase (MCT. 




FroaiB nucleotide 13909 to 14898. sizefaa'ti 330 

fttfreAPAPKAPAQVPPPPSSPWALHTRGAASAPWABkAAL^PGQGS 
LGYSIKRLCLEDPGKELAQTQFTQPALYVVNALST^^REGAEQP^^ 
TGI^VXKRGELMSGASGGTft^WGCDAVAVEQVrS^ 

SPVQWTATVRYLLEQGVEDFEELGPGRVLTRLrrANKRGV^ATAAPAKwSA ^^^^^ 

TaK - 3-o3:oacYl rA CPI synthase fKAS I., FabB) 
From apcleotide 14963 to 1^213. sizefaat; 417 

5^!^^yS^^^®°^'^'^'^^^*^Q°^SFTSAlXEGAASFRVMERP^QHQANGQTrAHLG^ 
AAYGSTFMDTDLVGLCTQQFAmGMSFTVQGASASGLI^VlQAAEXVL^irVDV 

cqglramgamgtdrfarjepebacrpfdresdgfifgeacgaVvveS^^ 

DASRGPLSSIERESQVIGAAlJlHADL>JERVDYV>ffHGSGSRQGDArS^^^ 
^J^g^°^^°LlAlXVQl^QGIU,j.:,SL>rx.VT:.piDSSFRWGAT 

Tali - enoyl CoA hvdratase- \ 



From nucleotide 16224 to 17009. sizefaals 262 



MQAASPPHRDYQTLRVRFEAQTCPLQLH3lPDAD>mSRTLIDECQQVL^ 
GADFRAIHBRVDDGIGUSQGNAEQLYia-WLQLATGPYVTV^AI^ 
FSLSELLFGLFPACV>4PFLARRIGIQRAHYI-TLMTRPIDAAQALSWGLAr^ 
KP A VTQYKKYASELGGQLIAAAdPRAISA>iEAMFSDIUTI-^ 

\ 



2290.000^5 

TaM - enovACoA hvdratase. 

From nucgeotide\l7000 to ]7767> aizefaal: 256 

MGIMTEGTPMAPY/TIJIEVEEGVAQITLVDRE^^^ 
YFALGGTKAClJLShzDGIGSFSrVTNFYSLALECV^ 
F^^YGFTPGMGATYIVPKRLGYSLGI^^ 
PKI^SLVTLKRHLVimnUlELPDVIERET-Kl^ 



TaN " O-methvitraiisferase (fragment^ 
From aanjacleotide 17782 Vo 19053. sizefaa'): 423 

MLNLIbTNHAHGYvVTPVVLAC^ 

addvyavtaaaaah3lsfprsaqslfalp^mrylrgedglsi^ 

plmi-aleqrgglkearrlsd]\wsggdgmtcvpeavqhelagffsaqkt^^ 

kallfaivgsykpmlasmpqiAfgdcdq^^ 

aenlsaqpryladmgcgdgtlakrvyetvlrhi^ 

alradvarpdrliedlrarglaepentijeilrsfldm 

evfhslvehle \ 



DNA seggence 1-19053 \ 

gtcgacgttgacgtcgcccggtggccto-gccgtgtgtcttcttcgacgcggaggtgcgcgaggtggcg 

gcggacggccggcgcgggccgctgttgtggcgtgagcgcgcgtatgcgccggtactggcgctgcgt 

ggccagcgcctccatgcttcggtgtccttttcgcccgcgtcgctgatggctccggtggagg 

ggtgcaaggccctgccaggcacggtgcccgcgtcctggtatcagacggcgcacccggaggccctgt 

cctgggagcgcgtgggcgcggtgggcgXvtcctgcctcgtggtgggtgaactccggaggggccctg 

tcgagggcagctacgccctggtcggtcgggagggcggccccgcgatgttggtgctgggaccccagg 

ctccggccacctgtgggacgctggcgcgccgggcetggcggcacttcggggcggccgggc3tgctgt 

ccatggccgcggccgtcgtcctgtcaggggcgctgtgagacgcgcggcgggggccgtaccgccgcg 

ccagaaacgtgatgcgccgccaggcctcgcggtccgggcactgacgcccgggccgctcgggactcg 

ctcaggcqgctccggtgcttcgcgcggtggagaacacgagctgttcctcgctgtcc^ 

ggtgagggtccgctccacgccggcgaggcccWgcggcgtggactgcgccaggtccgagagcaggga 

gcccgcagcgcgcaggtggaagccggtggtcgacatgccctccagctcgccgaacacggtgcgcag 

ctcgtccggggccttgcgttcgtcgatggcgccmcgcaggcgcaccatgccgctcacgaagttcctg 

caccgcatgagcgtgttggggttcttggaqatcaacctccgcgaagctgaagttgccg^ 

gccactcgctttcgatgagctgcaggtccttgcqaaggtcgcgcaccgtggggtcct^^ 

ccacttggcgatctgcgcggcgcggctgggtgacaagtcattcagcl^vtgag 

acctcgcggaggtaggcctcccagatgggggggmgaccgcgcgccggaactcggcggaggcgcgg 

ggaagctgctcgctccagggcagcgcgcgggccaffiggcgcgtgagaagcgggacagctcgagcgtc 

tggatgcggggcaccagggcgcggaacgagtcatgcgccatcttcagctcgagccgcgttt^ 

gggtgcttgcccatgtcctcctgcatggtgctgatgacggtggccgcgtccttcgcgtccagcacgc 

cccagacgacggcgtcatccaccgcctgcjrgcacgtccttccgcgacagcttcttctgccgtcccca 

cagcatcaggtgcaggccgtagccgaagtcggaggccacggggqtgggagagaagcgcgtcatgtc 

cgccttcaggtccaccacccactggccgcgggtgcccagcagcttgtcctcgtagccccggcgtccg 

AGGAACGCCATGCGCCGGGCGGCCTGGAaGTTCTCCTGCGTCACCCAGGACTGCTTGCTGACGTTCT 
TCGCCTTGAGCAGCTCGAACGaGCCCAGCCCCAGTGaQAAGCCGATGGCCTGCTGGCGCGTGAGCG 
TGGTGGCGTGCaGGTAGTTGCGCAGCTCGAACTCGCTCTGCTGGGCGGGGAGGCTCTTCATCCACTT 

cagcagctccaccaggttgcccttgagcagggactcgtggaagcgcatcgcggtgacgtcctccacg 

acgacctccagcagcgtggaggtctccga'caggcccaggtattcgtactcgaagccctcggcgacct 

gcgtgcggacggcgttctccagcgactctgcgacgcgggccttgaagtcggcccaggcctgcggaa 

ggttggccgggtccgcaagctgcgggtcgatgccaaggcg&tccagcaccaggcccagcaccttctt 

ctgattgtcgtccagcttcgccttgctggccttgtccaccagcccgtccatgatttccaccccggtgg 

tgcccttcttgacgaggtcgcgaaggacgggccccagcagci^ttccagcaactggcccagttggg- 

ccttcaccgtcgccgggtccagcagctccacggtgatgcccaffiggcggcggagagggcctgc 

ggacttcaccttccgcacggcgacctggatgcggcgggcacgffiggacgggagaagctgagctggta 

gtcgtcggtgagggacacccgggcggacaggcccgccttggcfcctggtcttcaacgcgagcagctc 
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gttgccgcgc\ggaagcccaggcggttgaggttggtgggga^ 

ccgtgtgtggatecgcgccgcggacatgccacgccaccgcgtcccccgtggtcagcttggccaggtcg 

gtggcctgcatcVgccgcagctcggacaggtcggagcgcacggcctcacgcatgttctggcccagc 

ggatgcgcgcggi;agtcgctgagcgtgacggac.\gctccgtctggttctcggaggcgaggaaggac 

aggctggcgctcaoggcggccttcacgcgcgcggacacctggtagcgcatccacccgatgtcactgc 

ccagcagcagttggiigccggggccctggctcggcgctctccgggctcggctc 

ttcgtccacgtcgccc^gcgagttgaagttccggatgqacgcttcgccggtggcttcgaaggagag 

ggtgaagccgtgcatggcgagcttgggcggaaggattttctcttccgtgaaggtcccctgggccagc 

ggcaccttgatgtcctgstcctgc.\gcttcacgtcgggcaccttgcccgccacgacgtcgggaagct 

tctccagcagcttgttgaJscactttcatgcgcgtccccctgggctgaagcctcctgcacgtgggccg 

gaggtctcttcgtcgtacgfecgttgcccagctcggaacaaggcggataccagaaaagaccggtggt 

CAGCGGACAGATGCCCTGGAGGGTGGGGTGGGAGCCGCCCCCGCGCGGTGCGTCAGGGCTCGTCGC 

CCAATCCGTaCTCCTTGAGTBTCCGCGCGAGCGTGTTGCGGCCAATCGCCAGCACGCGGGCCGCGG^ 

GGTGCGGTTGCCCTTCACGGCGTCCAGCACGCGCAGGATGTGGCGGCGTTCGACCTCCGCGAGTGGC 

AGCAGGCCCGCATCCAGGGGCffiCAGGCGCAGTCGGCGCGTTGACACCCTGAGCGGCTGCGGGAGGC 

GGCAGGGCGGCCCGGTCCACA'TCGGGCAGGGGCAGGTGCTCTTCGAGAATCTCCCCTTCACAGaGC 

accacggcgctctcgatacagttctccagctcccgcacgtttccgggccagcggta 

gctccaccgcggcggcgctgaggfcggggcggcgtcagccggtgcctccgggcgacggcggcgacga 

agtggcgggcgagccgctcgatgtcctccgcgccgcgctcccgcagcggcggcagcaccacctcga 

ccaccttgatgcggtagtagaggtoctcgcggaagcggccctcggccaccatgcgggccaggtccc 

gatgggtggccgcgacgatgcgcacgtccaccttcacggcctgggtgcctcccacgcgctcgaactc 

gcgatcctggatgacccgcagcaactrgccctgcaccggcaggggcagctcqccaatctcgtcgatg 

aacacggtgccgccgctggccgcttcaaacttgccgggc;alcgcg 

ccgcgttcgtggccgaa.gagctcgttctcgatgagcgtggcqggcagcgccgcgcagtccaccttga 

tgaagggctggtccctgcggggaccatttv^cgtggacggcacgggcgaacagctcct^ 

cactctcgccgcgcagcagcaccgtcgcatcggtgggcgcggccttgcgcaccagtcggtagat 

cctggagctgcggggactcgccgatgatc^ggttgaagaagtagcccaccggtacctggggctgct 

ccttcgcgcgctggagctcttgatagaggotggtgctctggagggcggtgctcacctgcqagecgat 

ggcggtgagccgctgcgtgtcctcgtcggtgaagcggtcctcgccgcggcggttgaggacctggag, 

cacgccgtagagggccccgtccccgtcgcg<^qtggcacggcgagcaggctggtggtgcgcjtagec 

cgtcatccggtcgatgtccgcgaagaagcgctgctcgccgcgcgggtccggcacgttgatggcgtg 

ccccgccttggcgacggtgccggcgacgcccitggcccagcttgacgcgaatctgggacacctcgggc 

aggtgcgcggcgcggctgaacagctcgcggcgggccgggtccaccagccagagcgtgccgeggtcc 

gcttgcagggtgatggcgatgcggtccatcagcIetctggaggaacgcgtcgaggtccacctccctg 

ccgacgagtcctccgaaggqgaggaggagctggwgacgtccgagggggcttggggcatgkicg^ 

caacggcggcaggacgaaggcggaggccgcaccAtaacatccagagggcatgggactgccccctct 

caggccgcgcggcccagcaccagccgctggccttcgcgtgcgggctcgcacacggggaagagggcg 

cgggcctgctccgccatgtcctcgagcatgtcgtcgccgtgcgccgggtcatggtggaacaggcaca 

gccggcgcgcccccaccagcccggccacgcccgcggcatccatcatggtggagtggccccagccctt 

cttcggcacgcccttgcggccctcgtatrcgtccggcatgtactc3cgcatccaggca 

gcgccctcgaagaggcggcccacctccggcgcgagctcctgcacccgcacctccacgtccgtggcgt 

agacgaacgaatggcgatccgcctccaggcggtacgcaaggcacccctgcgggtgcggcacgtcga 

tgggcgtgacgcggaaggggcccacctccacgggtcgggcatgcaacgccgagcggaagtccatcc 

gcgagcgcatggtgctcagcggcaccggaaaatgaagcbgctgcatctgcgcggccaactcggact 

ggagcgcctgggccccattcgcgcccggaccgtagagcgVcagctcggacgtgggcagccaggccg 

gcgtgaagaaggggaagccctgcacgtggtcccaatgcacatgcgagaagaagagcgtggcctcct 

ggggcgcgccctcgcgcatcatgatttcgcccagtgcgcgffiatgcccgtccccgcatccaggatgag 

gcggtggccctggctggtcacctccacgcaggccgtgttgocaccaatgcgcgagcccgacaccgcg 

atgctcccccgaacgccatgaaaccggacttccatcgtaagtctccttgaatggggggcctccgcct 

gggacgccctcatggccggagcctcagagcacggggtgtgccattcccaaatgcccggaatgv^gga 

GCGCGGGCCTCGGGCTCGTCCACCGGTGCTCCaGAATGGaTCGCGCTCGCCTGGTGCGGGCGATCCA 
AAGCGGTGCAGGTCGCCCGCAGGACGGGGCGGCGGGCACGTCmrCCAACGTCCCACGGCAGTC^^ 
TCTTCAGATCTCTCCCGATGCGGGAAGGCGTCCAGGAGGTTGCkcCCGGCATCGAGCGGGGCTGTGT 
GTTTCAAGTCTTGTCGGAGCCTCGGACACAACCGTCTGGGTTCTCGGAATGCGCCGGCTTCCGT^ 

ctccagagtqattcaatggctctcgagtgcaggtttagcaatcc'tcgggccgtaac 
aggcagtcacgctctcgtcacgcttggggtgtttccagcttcaacsgtgtttatccttcag^ 

TTGCTTGACACGCTGCCTCATCGAAGCGTATGCAAAACAATGAAAA-CGGTGTCGTTGCCGAGCCTTa 

GGGCCTCCAGAACGCCATCCTCGCGGACCCAGGCAGCCGGAATTT(^GACQGGGCTGTCaGCGGTT 

TG.^^CGCAAGGATGCGGCGGGGGTTGTGGCGGCAGCCCGACCAGAATTCGGTrGGtGTGCCAGTTA 

TTGTa^GATTCTGAGAAATAGCAGGCTGGGGGGAAGTTGCAATGCC'KjGGCCGCGGTGTGCTGAGA 

ACGATTGGGTTGCATTGCTCGTCCGCCTCAATCACGAGAAAGTGGCT<k:CGCTCAGTTGGG^ 

CGGCTACGAGTrCTTCCTGCCGACGTACACGCCTCCCAAGTCCTCGGGTC 

CTCTTCCCCGGGTACCl 1 1 iCTGTCGTTACCAGCCGCTCAATCCGTACcdcATCGTCCGGGCGCCCGG 
GGTCATCCGGCTGCTCGGAGGTGACGCGGGGCCGGAAGCCGTGCCCGCACAGGAATTGGAGGCCAT 
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CCCCCGGGTCGCGoA^CQGQTGTCTCTrCCAATCCCrGTQACTATCTGCCGGTGGCGCAGCCCOrQ 
CGCATCATCGAAGGGCCCCTGACAGGTCTGeAAGGA.'^GTCTGGTGACGAGCAAGAGCCAACTCCGG 
TTCATTGTCTCCGTGGaGCTGCTACAGCGCTCC-GTGTCCGTGGAGGTGAGCGCCGAGCAACTGGA^ 

cgatcaccgactgattccgcggacatccccttccattccttcatcaccccgacccgcagcaaggcttc 

agggaccgtgagtcgttccatggacaagagaattattrtcgacatcgtcaccagcagtgt^ 

gtggtacccgaa.ctcgaa(rcacatcggttcgagccggaggatgacctggtcggactgggcgcgaac 

tcgctcgaccgcgccgaaAtcgtcaacctcacgctggagaagctggcgctcaacatcccccgggtcg 

agctgattgacgcgaagaccattggcgggctggtggacgtccttcacgcgaggctgtgaggcgaag 

ccatggggccggtcgggattcaagccatgaatgcctactgtggcatcgccaggttggatgtgttgca 

gctggcgacccaccgtggcctggacacctcccgcttcgcgaacctgctcatggaggagaagaccgtc 

CCGCTCCCCTATGAGGACCCTOTCACCTACGGCGTGAATGCCGCCCGGCCCATCCTGGACCAGTTGA 

CCGCGGCGGAACGGGACAGCATCGAGCTGCTGGTGGCTTGCACGGAGTCCTCGTTCGaC 

AGGCCATGAGCACCTACCTGCAaCAGCACCTGGGGCTGAGCCGCAACTGCCGGCTCATCGAGCTCA 

AGAGCGCCTGCTACTCCGGGGTCfecCGGGCTGCAGATGGCCGTCAACTTCATCCTGTCCGGCGTGTC 

GCCGGGGGCCAAGGCCCTGGTGGTGGCCTCCGACCTGTCGCGCTTCTCCATCGCCGAAGGGGGAGA 

TGCCTCCACGGAGGACTGGTCCTTqGCGGAGCCGAGCTCGGGTGCGGGCGCGGTGGCCATGCTGGT 

GAGCGACACGCCCCGGGXGTTCCGOGTCGACGTGGGGGCGAACGGCTACTACGGCTACGaGGTGAT 

GGATACCTGCCGCCCGGTGGCGGACAGCGAAGCGGGAGACGCGGaCCTGTCGCTCCTCTCCTACCT 

ggactgctgtgagaacgccttccgggagtacacccgccgcgtccccgcggcgaactacgcggagag 
cttcggctacctcgccttccacacgccgtttggcggcatggtgaagggcgcccaccgcacgatgatg 
cgcaagttctccggcaagaaccgcggd«3agatcgaagcggacttgcagcggcgagtggcccccggg 
ctgacctactgccagcgcgtggggaackxcatgggcgcgacgatggcgctctcgctcctcgggacc 

ATCGACCACGGCGACTTCGCCACCGCGAAGCGGATTGGCTGfCTTCTCGTATGGCTCGGGGTGCAGCT 

CGGAGTTCTTCAGCGGCGTGGTGACGGAfeGAAGQGCAGCAGCGGCAGCGCGCCCTGGGGCTGOGA 

GAAGCGCTCGGGCGCCGGCAGCAGCTCrfccATGCCGGATTACGACGCGCTGCTGAAGGGGAACGGC 

CTGGTGCGCTTCGGGACCCGGAACGCCG>^CCTGGATTTCGGTGTCGTCQGCAGCATCCGGCCGGGC 

GGGTGGGGCAGGCCCTTGCTCTTCTTGTCGGCGATTCGTGACTTCCATCGCGACTACCAATGGATTT 

CCTAGCCTCGGGGCTTCGAGCaAAGCCATOTCCAGCGTAGCGACGGCCGTCCCCCTGACGGCCCGTG 

ACAGCGCGGTGAGCCGCCGGCTGCGAATCACCCCCAGCATGTGCGGCCAGaCGTCCTTGTTCGCCGO 

gcagattggcgactgggcatgggacaccgtffiagccgcctgtgtggcacggacgtgctgaccgcgac 

caacgcctcaggcgcgcgciacctacctggccttctattacttccgcatccggggcaggcccgcgctg 

catcccggcgcgctgcgcttcggcgacacgcttggacgtcacgtcgaaggcgtacaacttcggcagc 

gaatccgtcctgacggtgcaccgcatctgcaAjAcggcggagggcggcggtccggaggcggatgcc 

ttcggccatgaagagctgtacgagcagccccagccaggccgcatctacgcggagaccttcaaccgg: 

tggatcacccgctcggacggcaagtcgaacgagagcctgatcaagtcctcgcccgtggggttccag 

tacgcacacctgccgctcttgccqgacgaatactcgccgcggcgggcctatggggacgcgcgggcg 

cggggcacctttcacgatgtggactccgcggagtaccggctgaccgtggaccgcttcccgctgcgct 

acgcggtggacgtcatccgggacgtcaatggggtggggctcatctacttcgcgtcgtatttctcgat 

ggtggactgggccatctggcagctggcgaggcaacagggacgcagcgagcaggccttcctgtcgcg 

cgtggtgctggaccagcaactgtgcttcctcggcAacgcggcgctggacaccaccttcgacatcgac 

gtgcagcactgggagcggqtgggcggcggggaa<aagctgttcaacgtgaagatgcgcgaqggcgc 

gcagggccgggacatcgccgtggcgacggtcaagatgcgcttcgacgccgcttcggaaggaggccg 

ccgtgggtgagccgatgacagacgaacaaatccgcggagtcgtgcaccagtcgatcgtgcgcgtcc 

tgccccgcgtgcgctccaacgagattgcgggccacttgaacctccgcgagctgggcgcggactccgt 

GGACCGGGTCGAGaTTCTCACGTCCaTCCTGGACAGCCTGCGGCTGCAGAAGaCGCCACTGGCGAA 
GTTCGCCGACATCCGCAACATCGACGCGCTGGTGGcdxTCCTGGCCGGTGAGGTCGCGGGTGGCTG 
AGCGGGTTCCdGGCGGAGTCGGCATCGAGGCCATCAji^GCCTACGGCGGCGCCGCCTCCATTCCGG 

tgttggacttgttccggggccggcggctggaccccgaagcgattctccaacctgatgat^ 

gcggcgtcgcgctgccgttcgaggaccccgtcaccaacbcggtcaatgcggcgcggcccatcctgg 

acgcgatgtcgcccgaggcccggqagcgcatcgagctcctggtcacctcgagcgagtccggcgtgg 

acttcagcaagtccatctcctcgtatgcgcacgagcacctggggctgagccgccactgccggttcct 

ggagotgaagcaggcctgttacgccgccaccggagcgciccagctagcgctgg^ 

gggcgtgtcaccgggggccaaggccctggtgattgccacffigacgtgacgctggtggacgagaqcqg 

tctgtactccgagccggcgatgggcaccggcggcgtcgcogtgctgctggqcgacqagccgcgcgt 

gatgaagatggacctgggagcgttcggcaactacagctacgacgtcttcgacaccgcgcggccctc 

gccggagattgatatcggcgacgtggaccggtcgctcttc/icgtacctggactgcctcaagcacagc 

ttcgccgcgtatggccgccgggtggacggtgtcgacttcgtgtcgacqttcgactacctggcgatgc 

acacgccgttcgccggactggtgaaggccgggcaccjjcaagktgatgcgcqacctcaccccgtgcg 

acgtggacgaaatcgaagcggacttcggccggcgcqtgaagccgtcactgcagtacccgagtctgg 

tcgggaacctgtgctccggctccgtgtacctgagcctgtgcac^atcatcgacaccatcaagcccaa 

gcggtccgctcgggtgggaatgttctcctatgggtcgggttgdrcgtcggagttcttcagcggcgte 

atcggcccggagtccgtgtccgcgctacctgggttggacatcggtggccacctccgggggcgccgc 

cagctcacgttcgaccaatatgtcgaattgctgaaagagaa.ccttcgctgtctggttccaacgi^ 

accgggacgtggacgtggagcgctacctcccgctggtgacgcgcacggcgagccgcccgcgcatgc 
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\GCGGCCGGGCCGfCAGCATCACGCCAACGGGCAGACGACGGCCCACCTGGGGGCGGAAATCCCCT 
CGCTGGCCGTGCCCC^AGGCGTCACCCCACAACTGTGGCGCTCGGCCACGTTTTCGGGGCAGGCCGC 
ACTGCTGACCGTCCACGAGGCCTGGAACGCGGCGCGCCTCCAGGCCGTCCCCGGACACCCGATTGG 
ATTGGTGGTGGGGGGCACCAACGTGCAGCAGCGCGACCTGGTGCTGATGCAAGACGCCTATCGCGa 

gcgggtgccctrrctgco^gcggcctacgggtcgaccttcatggacaccgacctcgtgggcctctgc 

acgcagcagttcgccatccacgggatgtccttcacggtgggaggcgcatcggccagtggcctgctg 

gcggtcatccaggccgcgqaggcggtgctctcaagaagggtggacgtttgcatcgccgtgggggcg 

ctgatggacgtctcctactgggaatgccagggcctgcgggccatgggggcgatgggcaccgaccgg 

ttcgcgcgggagccggagcotgcctgccggcccttcgaccgggagagtgatggcttcatct^ 

aggcgtgcggcgccgtggtggcttgagtctgcggagcacgctcggcgacgcggggtgactcctcgcg 

gcatcctgtcgggctgggccatgcagttggacgcgagccgcggcccgttgtcgtccatcgaagggg 

^gtcgcaggtgattggggctgcgctgcggcacgcggacctcgcgccggagcgggtggactacgtga 

atcctcacggcagcggttcgcgtcagggggatgccatcgagctggggeccttgaa 

tgacgcacgcccgggtcaacaccXcgaagtccatcaccgggcatggcctgtcctcgqcgggtgccgt 

ggggctcatcgccacgctqgtccagttggagcagggccggctgcacccgtccttgaacctggtggac 

ccgattgattcatcgttccgctgggygggggccaccgcggaggcccagtccctccagaacgcgctgc 

tgctcgcctacggcttcggcggcat&aacaccgctgtcgccgtgcgccgqagcgccacggagagct 

GACACGCCCATGCAAGCCGCTTCCCdTCCGCACCGCGACTACCAGACGCTCCGGGTCCGCTTCGA 

CGCAGACCTGTtTTCTCCAGCTCCACqpGCCGGATGCGGACAACACCATCAG^ 

CGAGTGCCAGCAGGTGCTCACGTTAXqTGAGGAGCACGCCACCACGG 

acacgtgttctgcatgggcgcggattttcgagccatccacgaccgggtcgacgacggccgccggga 

GCAAGGCAACGCG<L\GCAGCTGTACCdpCTGTGGCTGCAACTGGCGACAGGCCCCTACGTGACGGT 
CGCCCATGTGCAGGGCAAGGCCAACGCffiGGCGGCCTGCGCTTCGTCGCCGCGTGCGACATCGTGCT 
GGCAAAGGCGGAGGTCCAGTrCAGTCTdrCCGAGCTGCTGTTCGGGCTGTTCCCCGCCTGCGTGATG 

ccgttcctcgcccggcgaatcggcatccagcgggcgcactacctgacgctgatgacgcggcccatcg 

ACGCGGCCCAGGCGCTGAGCTGGGGGTTcfccGGACGCGGTGGACGCCGATAGCGAGAAGCTGTTGC 

ggctccacttgcgcaggctgcggtgcctg-kcgaagccagcggtgacccagtaca^ 
ccgagctgggcggccagctgctcgcggcgiAtgccccgggccatctccgccaatgaggcgat^ 

CGACCGCGCCACGCTGGAiiGCCATCCATCGCTACGTGGAGACAGGCCGACTCCCATGGGAATCATG 

ACGGAAGQAACGCCAATGGCGCCGGTGGTckcGCTCCATGAGGTGGAGGAGG<3GGTGGCGCAGAT 

CACCCTGGTGGATCGCGAGAACAAGAACATCiTTCAGCGAGCAGCTCGTGCGCGAGCTCATCACCGT 

GTTCGGCAAGGTGAATGGAAACGAGCGCTAdcGCGCGGTGGTGCTCACCGGCTACGAGACCTACTT 

CGCGCTCGGCGGGACCAAGGCCGGCCTGCTG-TCCATCTGGGACGGCATCGGCTCCTTCAACGTCACC 

AACTTCTACAGCCTCGCGGTGGAGTGCGACATOCCGGTGATTTCCGCCATGCAGGGACATG^ 

GCGGCGGGTTCGCQATGGGGCTGTTCGCGGACirrCGTGGTCCTGAGCCGGGAGAGCGTCTACACGA 

cgaacttcatgcgctacggcttcacgccggggaVgggcgccacgtacatcgtgccgaagcggctgg 

GGTACTCGCTCGGGCATGAGCTCCTGCTCAACGCCAGGAACTACCGCGGCGCCGACCTGGAGAAGC 

GGGGCGTGCCTTTTCCGGTGTTGCCGCGCAAGGAAGTCTTQCCCCACGCCTACGAGATrGCGAGGGA 

CCTGGCCGCGAAACCTCGGCTGTCGCTCGTGACGdrCAAGCGGCACCTGGTTCGCGACATCCGCCGA 

gaccttccggacgtcatcgagcgtgagctggagatocacggcatcaccttccatcacgacgag^^ 

aggaggcgcatcgagcagctgttcctctgaggcgcgcccctatgttgaacctgatcaacaaccacgc 

acacggttatgtggtcacgcccgtggtcctggcctgcaacgacgctggcctgt^ 

CAGGGACCGAAGGACTTCGACCGGTrGGCGGAQGCATrGCGTGCCAACCGGGGACATCTGCGCGTC 

gcgatgaggatgttcgaatcgctcggctgggttcgcAgcgacqcggatgacgtgtacgcggtgacg 

gcggcggcggccgcgcatcggtccttcccccgcgagqcgcagtcgctcttcgcgctgcccatggacc 

ggtacctgcgcggggaggacggcctgtccctggcgcqptggttcgagcgctctcgggcgtcgtggg 

ataccgatgacacgctggtgcgcgagctgctcgacggcgccatcatcacgccgctgatgctcgcgct 

ggagcagcgtgggggcctcaaggaggcgagccgtctotccgacctgtggtccgggggggatggaa 

gggacacgtgcgtccccgaggccgtccaacacgagctqgccgggttcttctccgcgcagaagtgga 

CGCGTGAGGACGCCGTCGACGCGGAGCTCACGCCCAAGjGGCGCCTTCATCTTCGAGCG^ 
TCTTCGCCATCGTCGGCTCGTACCGGCCGATGCTGGCCAGCATGCCGCAGCTGCTCTTCGGTGACTG 
CGACCAGGTCTTCGGGCGGGACGAAGCGGGCCACGAACTGCACCTGGACCGAACCCTCAACGTGAT 
TGGGAGCGGCCACCAGCACCGGAAGTACTTCGCGGAGCTlGGAGAAGCTCATCATCACCGTCTrCGA 

tgccgagaacctgtcggcacagccgcgctacatcgcggacatggggtgcggtgacggcacgctcct 
gaagcgggtgtatga.^cggtgcttcggcacacgcggc<5Aggaagggcgctcgaccggtttccgct 

CACGCTCATCGCCGCGGACTTCAACGAGAAGGCGCTCGAApCCGCTGGGCGGACCCTGGCCGGGTT 

ggagcacgttgccttgcgcgcggacgtggcgcggccggaccgtctcatcgaggacctgcgggcgcg 

CGGGCTAGGCGAGCCTGAGAATACGCTGCACATCCGCTCpmrCT^ 

CCTCCCGCGGACAGGGCGGGGCTCCACGCCCGGATTCCGTTCGATTCGGTGTTCGTGGGCAAGG 
GGCCAGGAGGTGGTTCCCGCGGAGGTGTTCCACAGCCTGGTfeGAGCACCTCGAG : 



